Do Query Optimizers Need to be SSD-aware?
نویسندگان
چکیده
Flash-based solid state disks (SSDs) are beginning to supplant conventional rotating disks for performance-critical data in myriad DBMS applications, including decision support systems. Though SSDs provide the same block-oriented storage abstraction as conventional disks, their performance characteristics differ drastically. Whereas SSDs provide relatively modest improvements in sequential transfer rates (e.g., perhaps 2× improvement), they can provide over 100× improvement for random reads, resulting in similar sustained transfer rates regardless of the access pattern. Conventional query optimizers assume a storage cost model where sequential IOs are far less costly than random IOs, and select access paths and join algorithms based on this assumption. Given the drastic change in SSD performance characteristics, intuition suggests that optimizer cost models must be updated (e.g., to prefer non-clustered index scans more frequently). Surprisingly, our empirical investigation using a commercial DBMS finds it is not necessary to adjust query optimization when shifting relations from disk to flash—an SSD-oblivious optimizer generally makes effective choices. We make two main observations. First, we demonstrate both empirically and analytically that the range of selectivities for which an unclustered index scan can benefit from SSDs’ fast random reads is so narrow that it is inconsequential in practice. Second, our measurements show that the performance variations across alternative join algorithms on SSDs are generally smaller than the corresponding variation on disks and are dwarfed by the 5× to 6× performance boost of shifting data from disk to SSD. We conclude that existing query optimizers largely make correct decisions even when treating all storage devices as conventional disks, and the small and inconsistent performance gains available by making query optimizers SSD-aware are not worth the effort.
منابع مشابه
Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica
Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...
متن کاملA Study of Execution Plan Aware Mutations for Genetic Cyclic Query Optimization
Resumen— The increasing number of applications requiring the use of large join queries reinforces the search for good methods to determine the best execution plan. Specially, when the number of joins is too large to be calculated by a traditional optimizer. Previous literature describes Genetic optimizers that may yield invalid execution trees that have to be repaired. Most of them use non-data...
متن کاملCLARINET: WAN-Aware Optimization for Analytics Queries
Recent work has made the case for geo-distributed analytics, where data collected and stored at multiple datacenters and edge sites world-wide is analyzed in situ to drive operational and management decisions. A key issue in such systems is ensuring low response times for analytics queries issued against geo-distributed data. A central determinant of response time is the query execution plan (Q...
متن کاملBuffer-pool Aware Query Optimization
With the advent of 64-bit processors, large main memories are set to become very common. This in turn translates to larger buffer pool configurations in database servers. Query optimizers however, currently assume all data is disk resident while optimizing queries. This assumption will no longer be valid when buffer pools become 100’s of gigabytes in size. In this paper we examine how data pres...
متن کاملSpecifying Rule-Based Query Optimizers in a Reflective Framework
Numerous structures for database query optimizers have been proposed. Many of those proposals aimed at automating the construction of query optimizers from some kind of speciication of optimizer behavior. These speciication frameworks do a good job of partitioning and modularizing the kinds of information needed to generate a query optimizer. Most of them represent at least part of this informa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011